Picture for Keqin Bao

Keqin Bao

additional authors not shown

ARES: Automated Rubric Synthesis for Scalable LLM Reinforcement Learning

Add code
May 25, 2026
Viaarxiv icon

Unified Data Selection for LLM Reasoning

Add code
May 21, 2026
Viaarxiv icon

SAGE: Scalable Automated Robustness Augmentation for LLM Knowledge Evaluation

Add code
May 12, 2026
Viaarxiv icon

SkillGraph: Skill-Augmented Reinforcement Learning for Agents via Evolving Skill Graphs

Add code
May 12, 2026
Viaarxiv icon

On Predicting the Post-training Potential of Pre-trained LLMs

Add code
May 12, 2026
Viaarxiv icon

Towards Sample-Efficient and Stable Reinforcement Learning for LLM-based Recommendation

Add code
Jan 31, 2026
Viaarxiv icon

Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code

Add code
Jul 10, 2025
Figure 1 for Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
Figure 2 for Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
Figure 3 for Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
Figure 4 for Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
Viaarxiv icon

Boosting Parameter Efficiency in LLM-Based Recommendation through Sophisticated Pruning

Add code
Jul 09, 2025
Viaarxiv icon

CoRT: Code-integrated Reasoning within Thinking

Add code
Jun 12, 2025
Viaarxiv icon

MTR-Bench: A Comprehensive Benchmark for Multi-Turn Reasoning Evaluation

Add code
May 26, 2025
Viaarxiv icon